View-Size Estimation in Self-Organizing Databases

نویسندگان

  • Kyoung-Hwa Kim
  • Rada Chirkova
چکیده

Data-intensive systems routinely use derived data, such as indexes or materialized views, to improve query-evaluation performance. In this context, the problem of dsigning derived data is as follows: Given a database and a set of queries, return definitions of derived data that, when precomputed and stored in the database, would reduce the evaluation costs of the queries. Designing materialized views and indexes is an important part of automated query-performance tuning in data-management systems that experience changes over time, where a system addresses the performance requirements of current frequent and important queries by periodically reconsidering and rematerializing the stored derived data. In this paper we focus on the accuracy of estimating the sizes of views that are considered for materialization. We analyze and experimentally evaluate several size-estimation techniques and suggest guidelines for using the techniques depending on system parameters. We describe experimental results obtained in our prototype self-organizing database system QPET. Our guidelines are also applicable to estimating the sizes of intermediate results in query optimization.

برای دانلود رایگان متن کامل این مقاله و بیش از 32 میلیون مقاله دیگر ابتدا ثبت نام کنید

ثبت نام

اگر عضو سایت هستید لطفا وارد حساب کاربری خود شوید

منابع مشابه

Self-organizing systems for knowledge discovery in large databases

We present a framework in which self-organizing systems can be used to perform change of representation on knowledge discovery problems, to learn from very large databases. Clustering using self-organizing maps is applied to produce multiple, intermediate training targets that are used to define a new supervised learning and mixture estimation problem. The input data is partitioned using a stat...

متن کامل

The Time Adaptive Self Organizing Map for Distribution Estimation

The feature map represented by the set of weight vectors of the basic SOM (Self-Organizing Map) provides a good approximation to the input space from which the sample vectors come. But the timedecreasing learning rate and neighborhood function of the basic SOM algorithm reduce its capability to adapt weights for a varied environment. In dealing with non-stationary input distributions and changi...

متن کامل

Estimation of Missing Daily Precipitation and Runoff Using Self-Organizing Map (A Case Study: Mazandaran Province)

Expert aquatic designers face many problems; among these, in hydrology, defective occurrences in time-series can cause errors in the ultimate results of the study. This more often happens in the regions where the number of hydrometric and rain gauge stations is limited. In addition, assessing, developing and maintaining the use of water resources require accessible long-term and high-quality qu...

متن کامل

A Modfied Self-organizing Map Neural Network to Recognize Multi-font Printed Persian Numerals (RESEARCH NOTE)

This paper proposes a new method to distinguish the printed digits, regardless of font and size, using neural networks.Unlike our proposed method, existing neural network based techniques are only able to recognize the trained fonts. These methods need a large database containing digits in various fonts. New fonts are often introduced to the public, which may not be truly recognized by the Opti...

متن کامل

Self-Organizing Databases: Near-Optimal Query Performance at all Times Using Flexible Views

Project Summary: The goal of this project is to develop new effective methods to improve the performance of sets of frequent and important queries on large relational databases at all times, which could improve the efficiency of user interactions with data-management systems. Solving the problem will have the most effect in query optimization, data warehousing, and information integration, whic...

متن کامل

ذخیره در منابع من


  با ذخیره ی این منبع در منابع من، دسترسی به آن را برای استفاده های بعدی آسان تر کنید

عنوان ژورنال:

دوره   شماره 

صفحات  -

تاریخ انتشار 2005